Exploiting textual queries for dynamically visual disambiguation

نویسندگان

چکیده

Due to the high cost of manual annotation, learning directly from web has attracted broad attention. One issue that limits performance current webly supervised models is problem visual polysemy. In this work, we present a novel framework resolves polysemy by dynamically matching candidate text queries with retrieved images. Specifically, our proposed includes three major steps: first discover and then select according keyword-based image search results, employ saliency-guided deep multi-instance (MIL) network remove outliers learn classification for disambiguation. Compared existing methods, approach can figure out right senses, adapt dynamic changes in outliers, jointly models. Extensive experiments ablation studies on CMU-Poly-30 MIT-ISD datasets demonstrate effectiveness approach.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploiting Visual Perception for Sampling-Based Approximation on Aggregate Queries

Efficient sampling algorithms have been developed for approximating answers to aggregate queries on large data sets. In some formulations of the problem, concentration inequalities (such as Hoeffding’s inequality) are used to estimate the confidence interval for an approximated aggregated value. Samples are usually chosen until the confidence interval is arbitrarily small enough regardless of h...

متن کامل

Wikimantic: Disambiguation for Short Queries

This paper presents an implemented and evaluated methodology for disambiguating terms in search queries. By exploiting Wikipedia articles and their reference relations, our method is able to disambiguate terms in particularly short queries with few context words. This work is part of a larger project to retrieve information graphics in response to user queries.

متن کامل

Automatic Image Annotation Exploiting Textual and Visual Saliency

Automatic image annotation is an attractive service for users and administrators of online photo sharing websites. In this paper, we propose an image annotation approach exploiting visual and textual saliency. For textual saliency, a concept graph is firstly established based on the association between the labels. Then semantic communities and latent textual saliency are detected; For visual sa...

متن کامل

Hybrid-LSH for Spatio-Textual Similarity Queries

Locality Sensitive Hashing (LSH) is a popular method for high dimensional indexing and search over large datasets. However, little efforts have put forward to utilizing LSH in mobile applications for processing spatio-textual similarity queries, such as find nearby shopping centers that have a top ranked hair salon. In this paper, we present hybrid-LSH, a new LSH method for indexing data object...

متن کامل

Topic Level Disambiguation for Weak Queries

Despite limited success, today’s information retrieval (IR) systems are not intelligent or reliable. IR systems return poor search results when users formulate their information needs into incomplete or ambiguous queries (i.e., weak queries). Therefore, one of the main challenges in modern IR research is to provide consistent results across all queries by improving the performance on weak queri...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Pattern Recognition

سال: 2021

ISSN: ['1873-5142', '0031-3203']

DOI: https://doi.org/10.1016/j.patcog.2020.107620